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Abstract 
> 

We propose a new scheme for increasing the throughput of video files in cellular communications 
^" 

systems. This scheme exploits (i) the redundancy of user requests as well as (ii) the considerable storage 

capacity of smartphones and tablets. Users cache popular video files and - after receiving requests from 



other users - serve these requests via device-to-device localized transmissions. The file placement is 
optimal when a central control knows a priori the locations of wireless devices when file requests occur. 
However, even a purely random caching scheme shows only a minor performance loss compared to 
such a "genie-aided" scheme. We then analyze the optimal collaboration distance, trading off frequency 
reuse with the probability of finding a requested file within the collaboration distance. We show that an 
improvement of spectral efficiency of one to two orders of magnitude is possible, even if there is not 
very high redundancy in video requests. 

I. Introduction 

Wireless video has been one of the main drivers of wireless (cellular) data traffic, and its 
importance is going to grow. One of the main drivers of such traffic will be mobile video. 



Originally, wireless video mostly implied short video clips (YouTube or news channels) on the 
very small screens of smartphones. The recent popularity of tablets and large-screen phones has 
opened the possibility for watching feature-length movies at high resolution on mobile devices. 
This increases the amount of data that has to be transmitted, and leads to predictions that video 
will soon account for the majority of all wireless data traffic. According to recent industry 
estimates [ 1 1 the traffic generated by video delivery requests will quickly outpace mobile web 
content and lead to an increase in wireless data traffic by two orders of magnitude. These 
developments, while enabling new business models and possibly consumer satisfaction, threaten 
to completely clog up the already overburdened cellular networks. 
Traditional methods for increasing cellular capacity are 

• Improvement of the physical-layer link capacity between transmitter and receiver. However, 
fourth-generation cellular systems (LTE- Advanced) and wireless LANs (IEEE 802.11n/ac), 
use a physical layer (MIMO-OFDM with near-capacity-achieving codes) whose spectral 
efficiency is close to the theoretical limits, so that further improvements will be very limited. 

• Use of additional spectrum. In the best case we can increase the available spectrum by a 
factor of 2, which is insufficient to satisfy the increased demand for data. 

• Decrease of the cell size to improve the area spectral efficiency. While promising, this 
approach suffers from the high costs of establishing new cell sites and providing the 
associated backhaul capacity Q. 

In this paper, we propose and analyze a novel architecture to improve the throughput of video 
transmission in cellular networks, based on (i) caching of popular video files in cellphones and 
(ii) base station controlled device-to-device (D2D) communications. Our architecture exploits 
the large storage available on modern smartphones to cache video files that might be requested 
by other users. Base stations keep track of the cache content and direct requests to the nearest 
smartphone that has the desired file, which is then transmitted via a D2D link. Since the distance 
between the requesting user and the user with the stored file will be small in most cases, multiple 
D2D links can be operated on the same time/frequency resources within one cell. This in turn 
leads to a dramatic increase in spectral efficiency. 



Existing Literature: Exploiting the redundancy of video requests in cellular networks is not 
new, but in the past has been used in fundamentally different ways compared to our approach. 
Wireless TV, such as MediaFlow, was exploiting the broadcast effect to supply the same video 
stream to many users simultaneously. However, consumers expect on demand capability for 
cellular transmission. Caching of popular files at the base stations, or at mobile switching centers 
is discussed, e.g., in [3], but it only helps to reduce the strain on the cellular backhaul, without 
alleviating the "on air" congestion. However, the efficiency of this approach is limited by the 
number of files that can be stored on a single device. A pioneering precoding scheme that works 
in conjunction with broadcast from the BS was recently introduced by [Hj. Caching files that 
are anticipated to be useful for a particular user on his/her personal device is another popular 
approach that was pioneered for set top boxes and has recently been applied to mobile devices 
[|5J. In contrast, we will present an approach that allows many devices to pool their caching 
resources, and exchange their stored files locally. This creates a much larger "virtual" cache, and 
thus dramatically increases the probability of finding the request files in the virtual cache. [] 

Also the use of D2D communications is not new. The literature on ad-hoc networks without 
central control is particularly rich, and we refrain from even attempting to give an overview of all 
the existing papers. Standardized commercial systems have recently been introduced (WiFi Direct 
(8j), or have been proposed (FlashLinQ [9]). The use of base- station controlled D2D has recently 



gathered the interest of the cellular industry, and has been reviewed, e.g., in [10|. The central 
knowledge of the channel states and information requests has been shown to greatly improve the 
performance. However, these systems aim to establish direct communication between arbitrary 
users (e.g., friends that want to chat to each other), and thus cannot influence the distance that 
needs to be covered. Our proposed system allows the BS to influence which nodes communicate 
with each other, leading to a completely different mathematical formulation and performance 
results. In particular, we can show that the throughput in our system is not limited by the 



Gupta- Kumar" limit [11 1. A somewhat different approach was recently introduced under the 



'An alternative, complementary, approach to dealing with the backhaul bottleneck was recently proposed in our recent prior 
work [6 1, [7 1, where femto-base stations are replaced by small base stations with high storage capacity but low backhaul capacity. 



name "microcast" by Keller et al. [12|, where multiple devices download from the BS and then 
combine the gleaned information. The difference is that micro casting speeds up the download 
for a particular user, but does not necessarily increase the sum throughput of the users in the 
cell, while our approach leads to an overall spectral efficiency increase. 

Main Contributions: One of the contributions of this paper is the introduction of a new 
paradigm for wireless video content dissemination with no additional infrastructure cost q] Our 
approach is based on caching popular video files in mobile devices, where groups of mobile 
devices create large "virtual" caches, in which file duplication is avoided as much as possible. 
The stored files are transmitted, upon request, to a user requesting a particular file, using highly 
spectrally efficient D2D transmission. 

As a second contribution, we provide an approximate analysis of such a system. The analysis 
is based on a subdivision of a macrocell into virtual clusters, such that one D2D link can be active 
within each cluster. The cluster size is a key parameter of the system, and can be controlled and 
optimized by the transmit power of the mobile terminals. We investigate two possible caching 
schemes, the first has a central control of all devices (and a-priori knowledge of where devices 
will be when files are requested); the second is based on random caching where each device 
stores files without any central control. We also provide a simulation evaluation section based 



on real- world video popularity distributions taken from [15|. From these results, we find that 
improvements of the video throughput by one to two orders of magnitude are possible. 

The remainder of the paper is organized as follows: Section [n] describes our new architecture 



and its mathematical model. Section III analyzes the optimal placement of files in the cache. 



Section [TV] derives the optimal cluster size (transmission distance between nodes). Section M 



analyzes the overall performance of the scheme, and Section VI provides conclusions and an 
outlook for future studies. 

2 We first suggested the concept in a submission for the Intel VAWN funding program jl3) and in the conference version of 
this paper, 1141. 



II. A NEW ARCHITECTURE FOR CELLULAR VIDEO 

Our new approach to solving the video bottleneck in cellular systems is based on the following 
two key observations: a large amount of video traffic is caused by a few, popular, files and 
storage is a quantity that increases faster than any other component in communications/processing 
systems: 

• The popularity of video files is very unevenly distributed. "Viral" YouTube videos, movies 
that are newly available for rental, and reports from recent sports events are typical examples 
of highly popular videos, which account for a considerable percentage of all video traffic. 
In current cellular networks, the video is downloaded by each requesting user via the base 
station of the cellular network, which wastes precious spectral resources. On the other hand, 
the redundancy cannot be exploited by a simple broadcasting scheme, because consumers 
want to view video on demand, and not be bound by predetermined starting times. 

• Storage space is the fastest-growing quantity on communication devices, outpacing the 
growth in data rate, and even Moore's law. In particular, recent years have seen an enormous 
proliferation of smartphones and tablets that have anywhere between 10 and 64 GByte 
of storage (not to mention the 500 GByte on typical laptop harddisks). This storage is 
commonly under-utilized. 

Users request video files according to an (empirical) distribution that reflects the popularities of 
the files. This distribution changes slowly (over the course of days or weeks), reflecting, e.g., the 
popularity of particular YouTube videos, recently released movies, etc. Thus, the popularity of 
files can be learned and predicted. However, caching of the most popular files at each wireless 
device is not effective, since the storage space at one particular node is insufficient to store more 
than a few popular files. We suggest that groups of mobile devices collaborate to exchange files 
via D2D communications. We can say that clusters of collaborating devices "pool" their caching 
resources to provide a "central virtual cache" (CVC). 

Figure [T] illustrates the overall architecture of a single cell with a BS and multiple users. 
Dashed circle shows a group of users that communicate with each. We divide the macrocell 
into groups or "clusters" of nodes (indicated by dashed circles) that can exchange information 




Fig. 1 . An example of the single-cell layout. 



with each other, constituting the virtual caches. Each of the devices store files according to a 
specific caching rule - the choice of this rule will be discussed below. For example, without 
loss of generality, we assumed that user i stores file Fj. Users request files from the virtual 
cache. If the video file is found in the part of the virtual cache that is located on the users own 
device, (e.g., user 1 which requests F{), then the file is available immediately (we call this case 
a "self -request" - it poses the least demand on the wireless resources). If the file is found in the 
storage of another device in the group, i.e., another part of the virtual cache, then the file is sent 
through D2D communications to the requesting user, for example, in Figure [T] user 5 requests file 
F 4 which is stored in cache of user 4, thus a D2D communication will be established from user 
4 to user 5. Such D2D communications occur over a short distance, and are highly spectrally 
efficient: (i) short distance implies high SNR, so that a high rate can be achieved for such a 
link, and (ii) short-distance communications allows high spatial reuse, which implies high area 
spectral efficiency. 

We assume in this paper that all nodes use the same transmission power (i.e., there is no 
power control), but this common power can be optimized. The power determines the feasible 
communication range, and thus the cluster size. Finding its optimal value involves a trade-off 



between two counter-acting effects: (i) a small cluster size means a high spatial reuse, i.e., more 
D2D links can be operated simultaneously within one cell, (ii) a larger cluster size increases the 
probability that a user actually finds the file it requests in its vicinity. Thus a D2D link can be 
meaningfully established and the corresponding cluster is called active. 

The D2D communications is controlled by the BS: when a user requests a certain video 
file, the BS can inform it about the nearest device that has the file stored, and the file is then 
transmitted from that device via a D2D link. Since the BS has knowledge of all the links that 
can be scheduled, as well as the channel state information, it can optimize the frequency reuse 
between the devices. The distance between transmitting and receiving device is much shorter 
than between device and base station. Thus multiple D2D links can be operated on the same 
time/frequency resources within one cell. 

If a requesting device does not find the file in its neighborhood or in its own cache, it obtains 
the file in the traditional manner from the BS. We assume that the D2D communication does 
not interfere with communication between the BS and users. This assumption is justified if the 
D2D communications occur in a separate frequency band (e.g., WiFi). For the D2D throughput, 
we henceforth do not need to consider explicitly the BS and its associated communications. 

III. Model and Setup 

Assume a cellular network where each cell/base station (BS) serves n users. For simplicity 
we assume that the cells are square, and we neglect inter-cell interference, so that we can 
consider one cell in isolation. Users are distributed uniformly in the cell, and can communicate 
with each other through D2D communications, as well as with the BS through traditional 
cellular (uplink/downlink) communication. Each user device has a cache in which video files 
can be stored. For rotational simplicity, we assume that each user stores exactly one file, though 
generalization to multiple files per user is trivial. The BS is aware of the stored files and channel 
state information of the users and controls the D2D communications. 

For our theoretical derivations, we use simplified description of the physical layer, namely the 
protocol model, widely used in networking: two users can communicate if their physical distance 



is smaller than some collaboration distance r (TTJ, [ |T6| |. The maximum allowable distance for 
D2D communication r is determined by the power level for each transmission. Conversely, we 
assume that (given that node u wants to transmit to node v) any transmission within range r from 
receiver v can introduce interference for the u — v transmission. Thus, they cannot be activated 
simultaneously. 

We assume that all D2D links share the same time-frequency transmission resource within 
one cell area. Multiple transmissions on those resources are possible since the distance between 
requesting users and users with the stored file will typically be small. Furthermore, there should 
be no interference of a transmission by others on an active D2D link. To achieve this, the cell is 
divided into smaller areas called cluster, see Figure [2} All clusters are assumed to be square with 
equal area. As can be seen from figure [2} the cell side is normalized to 1 and cluster side is equal 
to r. We call r the collaboration distance. Users in each cluster can communicate with each other 
locally; however to avoid intra-cluster interference, only one such communication per cluster is 
allowed. Interference to other clusters is assumed to be negligible, or can be made negligible 
through an appropriate frequency reuse scheme. Clearly the physical-layer communication model 
is oversimplified, as it does not account for the inter-cluster interference, the fact that the pathloss 
coefficients might be different in different parts of the cell, and fading. However, our results 
indicate that it captures a fundamental tradeoff in D2D collaboration and as shown in our 
simulations, can give very high gains. Furthermore, we will demonstrate in Section [V] the results 
that are closely aligned with those of a more sophisticated channel and interference model. In 
other words, when we will take into account pathless, interference and shadowing, the optimal 
collaboration distance that maximizes the rate is very close to that in the simplified physical 
layer model employed in this section. 

We assume that users may request files from a "library" of size m. Each user requests a file 
from the library independently, according to a given popularity distribution. Based on numerous 
studies, Zipf distributions have been established as good models to the measured popularity of 



video files [15|, [17|. Under this model, the frequency of the z-th popular file, denoted by ft, is 



inversely proportional to its rank: 






f t = ^p—, l<i<m. (1) 

y — 

J = l 

The Zipf exponent 7 r characterizes the distribution by controlling the relative popularity of files. 
Larger 7,. exponents correspond to higher content reuse, i.e., the first few popular files account 
for the majority of requests. 

Thus, in order to enable D2D communication it is not sufficient that the distance between 
two users be less than r; users should also find their desired files locally in the CVC of their 
cluster, i.e., stored by one of the devices in the cluster. A link between two users will be called 
potentially active if one requests a file that the other is caching. Therefore, the probability of 
D2D collaboration opportunities depends on what is stored and requested by the users. The 
decision of what to store can be taken in a centralized or distributed way, called deterministic 
and random caching, respectively. 

• Deterministic caching gives the optimal cache configuration. However, it is only feasible 
if the location of the nodes and the Channel State Information (CSI) is known a priori, 
and remains constant between the filling of the cache and the actual file transmission. The 
filling of the cache can take many hours or days, such an assignment is really only feasible 
for fixed wireless devices. Thus we consider the deterministic caching mostly as an upper 
bound for the achievable performance. Furthermore, finding the optimal deterministic file 
assignment is not trivial. In a general random geometric graph, it can be shown [18] that 
finding the optimal deterministic file assignment is NP-Complete even when users are static 
and interference is ignored. However, in a clustering-based caching, we can find a simple 
method for centralized file placement. Given that there are k users in a virtual cluster, each 
of k users should store one of the k most popular files without repetition. 

• Random caching can be implemented when users are highly mobile. Each user caches files 
at random and independently, according to a caching distribution. Specifically, we make 
a heuristic choice for this caching distribution, namely a Zipf distribution with exponent 



10 



r(n) 






R 






O O 


o o 


° O 


o ° 


o ^ o 


Oo 




o 


o ^ 


o o o 


o 


o ° 


o 


• • 


• • 


• 


• • 


User O 


o ° o 


o 


o _ o 


' ° o 


©° 


o 


• • 


• •• 


o o o 


• •• 


° ° 


• • 


Cluster 


o • 


o • 


o o 


o 


~ o 


o 


- • 


• • 

1 


• • 


• • 



Fig. 2. a) Dividing cell into virtual clusters, b) In the worst case, a good cluster can block at most 16 clusters. In the dashed 
circle, receiving is not possible and in the solid circle, transmission is not allowed. 



7 C q The exponent of caching distribution is one of our decision variables which is not 
necessarily equal to 7,.. Thus, in this strategy we find the optimal r and 7 C to maximize the 
number of D2D links. Since the BS does not influence which user stores which file, and 
the presence of users in a particular cluster is random, there may be overlap of the cache 
content at different users. Such a duplication makes the CVC less efficient. Still, we will 
see in Section W\ that the performance loss compared to the deterministic caching is small. 

IV. Finding the optimal collaboration distance 
A. Deterministic caching 

In this section, we find the optimal r for given values of size of the library m and the number 
of users in the cell n when the users store files based on the deterministic caching strategy. In 
this strategy if there are k users in a virtual clusters, each of them should store one of the k 
most popular files without repetition. 

In the subsequent formulation we assume that two neighbor clusters can have D2D links 
simultaneously and we ignore the inter cluster interference, so the number of D2D links is equal 



3 For a slightly different deployment model, and the limit of large number of nodes, [ 19 1 recently derived a "waterfilling-type" 
optimum caching distribution. 
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to the number of active clustersjjln our simulation results, we justify this assumption by showing 
that the effect of interference is negligible. 

We define a binary random variable Oj for cluster j such that aj is equal to 1 if the cluster j 
is active; otherwise, it is equal to 0. The total number of active clusters in the cell, denoted by 
A, equals to: 

A = J>. (2) 

3 

Since a,j is a binary random variable, the expectation of a, is the probability that cluster j is 
active, i.e., D2D communication takes place in cluster j. Since users are uniformly distributed 
in the cell and all clusters have equal area, the expectation of dj, denoted by E[aj], does not 
depend on j, i.e., E[a,j] = E[a] for any j where E[a] is the probability that any cluster is active. 
Thus, from ([2]), the expected number of active clusters is given by: 

E[A] = Y J E[a j \ = ^E[al (3) 

3 

where \ is the number of clusters in the cell. The probability that a cluster is active depends 
on number of users in the cluster which is denoted by K. Therefore, E[a] can be written as: 

n 

E[a] = ^2E[a\K = k]Pr[K = k], (4) 

fc=0 

where E[a\K = k] is the probability that a cluster is active provided that there are k users in the 
cluster. Pi[K = k] is the probability that the number of users in the cluster is k. The number of 

2 2 

users in the cluster is a binomial random variable with parameters n and y, i.e., K = B(n, y). 

2 

Note y is the ratio of the cluster area to the cell area. The probability that there are k users in 
the cluster equals to: 

4 If we use guard distances to avoid inter-cluster interference, the number of active clusters is scaled by this constant "frequency 
reuse factor" [20|, Chapter 17. 
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P T [K = k]=\ H I (r 2 /2) fc (l-r 2 /2) n - fc , (5) 



k 



where j I = (n ™L m ■ The expectation E[a\K = k], defined in (4), is the complement of 

the probability that no D2D communication takes place in the cluster. D2D communication is 
possible if at least one of k users in the cluster can access to its requested file in the cache of 
other users. Thus, E[a\K = k] can be written as: 



E[a\K = k] = 1 - Pr[ui = 1 n u 2 = 1 n ... n u k = 1]. (6) 

The Ui for i = 1, ..., k is a binary random variable that is 1 if the user i cannot find its requested 
file in the neighbors' caches in the cluster, i.e., in the CVC excluding the file in zth user's cache. 
According to our assumptions in this section, given k, the CVC in the cluster is deterministic. 
Since users' requests are independent from each other, the random variables u,- t and Uj for i ^ j 
are also independent. Thus, we can simplify Eq. ([6]) as: 

k 

E[a\K = k] = 1 - Y[ Pr[ui = 1]. (7) 

»=i 

Without loss of generality, we assume that user i caches the ith most popular file. Note further- 
more the possibility of self-requests, i.e., a user might find the file it requests in its own cache. 
In this case clearly no D2D communication will be activated by this user. Accounting for these 
self-requests, 

Pi[u l = l] = l-(P CV c(k)-f l ), (8) 

where f is the request probability of the ith popular file and is given in ([!]). Pcvc(k) is the 
probability of hitting the CVC and is given by 

k 

Pcvc(k) = Ys^ ±<k<m, (9) 
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It is obvious that Pcvc(k) for fc > m is equal to 1. From §7§ and ([8}, we find: 

k 

E[a\K = fc] = 1 - JJ(1 - (Pcvc(k) - /,)). (10) 

Substituting #[a|lf = fc] in Q, we get: 

n / k \ 

£[a] = ^ 1 - JI(1 " (JW*) - /,)) Pr[K = k], (11) 

fc=0 V i=l / 

where Pr[i^ = fc] is given in ^. From ([3]), ([4]), and (11), the expected number of active clusters 
can be written as: 



9 . \ 
E[A] = —^E[a\K = k}Yi[K = k} (12a) 

fc=0 



2 



n / k \ 

^ ( 1 - H(l - (i^ yc (A:) - /,)) Vi[K = k). (12b) 



fc=0 \ i=l 



r ', — '_ v T": I 



Notice that Pi[K = k] is a function of r. Thus, E[A] is a function of an unknown variable r. 
To find the maximum expected number of active clusters or equivalently the maximum average 
spectral efficiency, we should take a derivative of E[A] with respect to r. While finding r opi 
analytically in closed form does not seem feasible, numerical solutions are possible with very 
low effort, as we require a root search of a function of one variable within the interval < r < 1. 
In this optimization problem, the self-requests do not play any role. However, some users can 
get files with zero delay because of self -requests. This will suggest to consider an alternative 
criterion, minimization of the download time as follows 

minimize (n - n self - E[A])u B s + E[A]ui D2 d, (13) 

where n se if = \ X1a-=o Pcvcik) Pr[X = k] is the average number of users that get their desired 
files from their own cache with zero delay. UD2D and ubs are the average download time through 
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D2D communications and the BS, respectively. Thus, the average total delay of all users equals: 

D = (n- n se if - E[A])u)bs + E[A}u D2 d- 



B. Random caching 

In this strategy, we again assume that the BS divides the cell into virtual clusters with size 
r. Our goal here is to maximize the average number of clusters, but now under the assumption 
of random caching. The first steps are similar to previous subsection. The expected number of 
active clusters and the probability that a cluster is active are given by Q and Q. The probability 
that a cluster is active when there are k users in the cluster is stated in (|7]). 

However, for the next step, we cannot directly apply the approach used in the previous 
subsection, since with a random caching strategy, Ui and Uj for i ^ j will be dependent random 
variables. This can be understood as follows: if user i requests a file according to the some 
popularity distribution and cannot find it in the CVC, it implies an increased probability that 
the current content of the the CVC does not contain (some) popular files. So, if user j requests 
a file independently from user i but according to the same popularity distribution, it is more 
probable that user j also cannot find its requested file in the CVC in the cluster. 

Assume now that users {1,2,..., k} in a cluster respectively store the files H = [hi, h 2 , ■ ■ ■ , h k ], 
then 

PrK = l]= J2 t 1 - ( E -^ - -M Pr ^l' < 14 > 

He{l,...,m} k hj£H 

where H = U^ =1 {/ij}. Since users store files independently, the probability that users in the 
cluster store files H, Pr[H], can be written as 

k 

Pr[H] = \[ Vhj (15) 
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where p h . is the probability that user j stores file hj, more precisely 

1 
Ph J = -£^, l<hj<m. (16) 

y — 

8=1 

As stated before, we assume that users store files according to Zipf distribution with exponant 
7 C . Using @, Q, ([7]) and ( fl4| ), the expected number of active clusters is given by 



x(i-II E [>-(£/* -/*)]*[*] 



fe 

*=1 He{l,...,m}* hjGH 



E[A] is a function of r and 7 C . The optimal r and 7 C can be found numerically using the above 
equation. Note that the computation of this probability involves a summation over exponentially 
many terms. We estimate this using Monte Carlo simulations which asymptotically converge (by 
the Law of Large Numbers) to the correct probability. Obtaining error bounds on the rate of this 
convergence for a finite number of Monte Carlo iterations and how their number needs to scale 
in the size of the problem is an interesting theoretical question that we do not investigate in this 
paper. We empirically found that our estimates converged after 1000 Monte Carlo iterations for 
the tested cases. 

Similar to previous section, one can minimize the download time as follows 

minimize (n - n sdf - E[A])uj bs + E[A]u mD (18) 



where ojd2D and lobs are defined in (13 (.Given that users stores files F, n se if = Zlf eF A 



where f h . is the probability that agent j requested its own stored file hj. Thus, we have 

n self = £ Pr[K = k] J2 Pr ^] E A 

k He{l,...,m} k hj^H 
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V. EVALUTAIONS BY COMPUTER EXPERIMENTS 

In this section, we provide some numerical results to investigate the effects of system param- 
eters on the optimal collaboration distance for deterministic and random caching strategies. In 
all figures except those considering the effect of the parameters, the number of users in the cell 
is 500 and the number of files is 1000. 

A. Deterministic caching 

In this subsection the performance of the deterministic caching is studied. First, we want to 
find the optimal collaboration distance which maximizes the average number of active clusters. 
In this optimization problem, we face with a trade-off between two factors: First, the smaller the 
collaboration distance, the higher frequency reuse. Second, the larger the collaboration distance, 
the higher probability of finding the requested files stored in the same cluster. 

Figure [3] shows the average number of active clusters versus the collaboration distance r, 
where the interference between clusters are ignored. We can observe that r opt decreases as 7 r 
increases. This agrees with the following intuition: the larger j r , the higher the probability that 
a (popular) file can be found very close to the requesting device, and the smaller the clusters 
can be made. 
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Fig. 3. The average number of active clusters versus the collaboration distance for deterministic caching with n = 500 and 
m = 1000. 
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Next we study the effect of our simplifying assumptions. We analyze by simulations a system 
that does take into account the inter-cluster interference, as well as pathless and shadowing. 
In this system, we aim to maximize the total average rate (note that maximizing average rate 
and maximizing number of active clusters might be different in such a more elaborate setting, 
while it is the same for the simplified system studied up to now). To provide a realistic channel 
model, we assume a standard power-law pathloss with path-loss exponent 2.6. Super-imposed 
on that pathloss is log-normal shadowing with a = AdB, which is assumed to be independent 
for all links. We do not include small-scale fading, assuming that it has been averaged out 
through (frequency-, time-, or antenna-) diversity. In order to include interference, we compute 
the optimal links, SIRs, and associated rates, as follows: in each cluster we assign files according 
to the deterministic distribution to the nodes, and generate file requests at random according to 
the Zipf distribution. We then establish all potential links in all the clusters. We now select active 
links from the set of potential links according to an iterative procedure: in a first round, each 
cluster schedules the links that has the highest received power. We next compute the interference 
computed by the scheduled transmitters at all receivers of potential links in the neighboring cells, 
as well as the SIR. If the scheduled link in one cell has a worse SIR than another link in that 
cell, we change the active link (which of course changes the interference in the neighboring 
cells). This procedure is repeated until no links change anymore. 

The total average rate versus r is shown in figure |4j Similar to the previous figure, the optimal 
collaboration maximizing the average rate, defined by r5, t , decreases as j r increases. Comparing 
figures [3] and |4} we can conclude that the optimal collaboration distance is slightly smaller when 
we take the interference into account. Intuitively, a smaller cluster size means that we have 
higher frequency reuse, but the probability of having empty clusters, which serve as a protection 
region for the interference, is higher. Therefore, in the presence of interference, a small cluster 
size is better, i.e., rfL. < r opt . Yet, we also observe that our (over) simplified model that forms 
the basis of our analytical treatment provides an optimal cluster size that is remarkably close to 
the optimal cluster size as obtained from the simulated rate optimization. 

From now on, we just present the simulation results for the average number of active clusters, 
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Fig. 4. The average rate versus the collaboration distance for deterministic caching with n = 500 and m = 1000. 



but similar explanations are valid for the average rate. In the remaining figures except those we 
want to consider the effect of 7 r , we set it to 0.6; this value is based on a study conducted on 



the University of Massachusetts Amherst campus in 2008 [15|. 

The effects of changing the popularity Zipf distribution exponent are investigated in Fig. [5] 
This figure shows the total number of clusters at r opt , called optimal total number of clusters 
(which is simply 2/r^J, and E(A) at r opt , called the optimal average number of active clusters, 
versus 7 r . We can observe from the figure that by increasing 7 r , the optimal total number of 
clusters and the optimal E[A] increases. For the small j r , there is a little redundancy in the 
users' requests, i.e., the probability of finding files in the CVC is generally small. Thus, to 
increase the chance of having D2D communication within a cluster, the collaboration distance r 
should increase. Hence, the optimal average number of active clusters decreases for small j r , as 
already explained above. In short, more redundancy in video requests results in higher spectral 
efficiency. 

In Fig. [6] the optimal collaboration distance achieved by two different optimizations is plotted 
versus 7 r . These two optimizations are based on maximizing the average number of active links, 
and minimizing the download time. For larger j r , the r opt minimizing the download time is 
lower than that maximizing E(A). Because there is a higher probability that a popular file is 
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Fig. 5. The optimal total number of clusters and the optimal average number of active clusters versus j r for deterministic 
caching with n — 500 and m = 1000. 



stored locally. The smaller the cluster, the more often this popular file is cached. This will thus 
increase the probability of a self-request and decrease the cluster size (compared to the case of 
maximizing the average number of active links). 



♦ - r optimizing the download time 
"— r optimizing the E(A) 




Fig. 6. The optimal collaboration distance versus versus 7 r for deterministic caching with n = 500 and m — 1000. 



Figure [7] shows the effects of the size of the library on the optimal total number of clusters 
and the optimal average number of D2D links. 

As the size of the library increases, users request from a larger set of files and there is more 
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diversity in requests. The probability that these diverse requests matches with the files stored in 
the CVC in the cluster decreases. So, by increasing m, users want to be surrounded by more 
neighbors in the clusters or equivalently to access to more files locally from the CVC. Hence, 
as it can be seen from figures, by increasing m, the optimal r increases and the optimal average 
number of active clusters decreases. 
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Fig. 7. The total number of clusters and the average number of active clusters versus the size of the library m for deterministic 
caching with y r — 0.6 and n = 500. 



Fig. [8] and Fig. [9] show the asymptotic behavior of the optimal collaboration distance and the 
number of active links as the number of users in the cell grows for two cases of 7 r = 0.6 < 1 
and 7 r = 1.4 > 1. We assume there is a library of size m = 30^ l /n (for a justification of this type 



of proportionality, see [21 1). For larger n, a user is surrounded by more neighbors given a fixed 
collaboration distance. In other words, users can find their desired files within a short distance 
which results in smaller optimal collaboration distance and larger number of active links. In our 
previous work, we have shown that for 7 r < 1 (resp., 7,. > 1), the optimal collaboration distance 
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(resp., n) where 77 = l^. Figure M and p\ confirm the asymptotic behavior of r opt (n) and E[A]. 



5 We use the standard Landau notation: f(n) = 0(g(n)) and f(n) — Q(g(n)) respectively denote \f(n)\ < cig(n) and 
|/(n)| > C%g{n) for some constants ci,C2. f(n) = Q(g(n)), stands for f(n) = 0(g(n)) and f(n) = Q(g(n)). 
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Fig. 8. The optimal average number of active clusters versus the number of users n for deterministic caching with 7 r = 0.6, 1.4 
and m = 30^n. E x = 0.5^- and E 2 = 0.35n. 
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Fig. 9. The optimal collaboration distance versus the number of users n for deterministic caching with "/ r — 0.6, 1.4 and 
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B. Random caching 

In this subsection we study the performance of the random caching. Here we have to optimize 
two parameters: (i) is the collaboration distance r, and (ii) the exponent of caching distribution, 

7c- 



Figures 10 shows the average number of active clusters versus the collaboration distance r 
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and the exponent of caching distribution, j c . As this figure shows and Fig. [TT] confirms, the 



optimal collaboration distance is equal to 0.2 and the optimal 7 C is equal to 1.5. 
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Fig. 10. The average number of active clusters versus r and 7 C for random caching with f r = 0.6, n = 500 and m = 1000. 
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Fig. 11. The average number of active clusters versus r for random caching for different 7 C , with "/ r = 0.6, n = 500 and 
m = 1000. 



The effect of system parameters on the optimal r is similar to the deterministic caching strategy 
and due to space limitation we omit the corresponding figures. 
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Figure [T2| shows how changing the request distribution exponent 7,. affects 7 Cj0pi , the optimal 
exponent of caching distribution maximizing the average number of active clusters. As it is 
obvious from the figure, by increasing 7,., 7 Cj0 pi increases. For the larger 7 r , the first few popular 
files account for the majority of requests. Therefore, to satisfy the users' requests, there is no 
need to cache less popular files and larger 7 C could be optimal. Another observation is that the 
optimal 7 C is not necessarily equal to 7 r and it is much larger than it. 
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Fig. 12. The optimal f c versus 7 r for random caching with r = r op t = 0.2, n — 500 and m — 1000. 



We compare deterministic and random caching in figures 13 15 and 16 These figures confirm 
that deterministic caching achieves better performance than random caching. This happens 
because in deterministic caching the cached files do not overlap. On the other hand, random 
caching is based on distributed placement and there is a chance of caching the same file in 
different devices; as a result the performance will decrease. 



Figure 13 compares the average number of active clusters versus the collaboration distance 



for deterministic caching and random caching. For random caching, the exponent of caching 
distribution, j c is assume to be the optimal value 1.5. As it is obvious from the figure, the 
optimal collaboration distance for both deterministic and random caching strategies are equal 
but the average number of active links is higher for deterministic caching, as expected. We also 
compare the average number of satisfied files which is summation of the average number of 
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D2D links and the average number of self-requests for deterministic caching, random caching, 
and the most-popular-caching-only strategies. In the most-popular-caching-only scheme each 
user stores the most popular file. Therefore, the average number of self-requests is maximized 
but users cannot have D2D communication because of the fact that all of them store the same 
file. As expected, the average number of self-requests for this simple strategy is independent of 



the collaboration distance. Figure 13 also shows that for large value of collaboration distance, 



E(A + n se if) is larger in the randomized caching. The reason is that due to randomness, the 
chance of self-requesting is higher in random caching than in deterministic caching. Therefore, 
the average number of self-requests is larger than that for deterministic caching. 
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Fig. 13. The average number of active clusters and number of self-requests versus the collaboration distance for deterministic 
and random caching (with optimal j c ) and one simple strategy where all users cache the most popular file, with <y T = 0.6, 
n = 500 and m = 1000. 



Fig. 14 shows the average number of self-requests versus the collaboration distance for 
different values of j r . This figure confirms that as the collaboration distance increase, the ratio 
of self-requests of the random caching to that of the deterministic caching grows and this is 
more significant for larger 7 r s. 



Figures 15 and 16 show respectively the effects of the size of the library and the number of 
users on the optimal average number of active links for both deterministic and random caching. 
The same arguments about the effects of n and m on the average number of active clusters as 
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Fig. 14. The average number of self-requests versus the collaboration distance for deterministic and random caching (with 
optimal 7 C ), for n = 500 and m = 1000. 



discussed in figures [7] and [8] are valid here. Figure [16] shows that for small number of users 
deterministic and random caching have very similar performance, but as n grows, the difference 
between their performance will increase. The reason is that when number of users grows, the 
probability of having overlap between cached files in random caching strategy will increase, 
therefore the improvement of its performance will be less than deterministic caching. 
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Fig. 15. The average number of clusters versus the size of the library m for deterministic and random caching with j r = 0.6, 
r = 0.2 and n = 500. 
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Fig. 16. The average number of clusters versus the number of users n for deterministic and random caching with -y r — O.i 
r = 0.2 and m = 1000. 



VI. Summary and conclusions 

The use of D2D communications has the capability of increasing the throughput of wireless 
video networks by orders of magnitude. When the transmission of the most popular files can 
be offloaded to D2D, which has high frequency reuse and thus high area spectral efficiency, the 
BS is freed up for providing rarely requested video files as well as non-video data. We analyzed 
both deterministic and random caching strategies, and pointed out that the latter is more realistic 
in networks with user mobility. 

While this paper presented a (simplified) system model and extensive simulations, much 
additional work remains to be done. The question of the optimal caching distribution, as well as 
strategies to learn the request distribution and bring the files into the cache, are of great practical 
importance. Furthermore, the subdivision of cells into clusters (with no communication across 
cluster boundaries) might not be optimal, and more general scheduling strategies are required in 
that case. 

Yet, our simulations have shown that even somewhat sub optimal clustering strategy can lead 
to orders of magnitude improvement of throughput for high user density in a cell. Our new 
concept thus presents a highly promising strategy for alleviating a key bottleneck in cellular 
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data transmission. 
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